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Abstract. Let W = {W n '■ n 6 N} be a sequence of random vectors in M d , d > 1. This paper 
considers the logarithmic asymptotics of the extremes of W, that is, for any vector q > in R d , we 
find 

logP (3n S N : W n > uq) , as u — > oo. 

We follow the approach of the restricted large deviation principle introduced in Duffy et al. [8]. That 
is, we assume that, for every q > 0, and some scalings {a n },{v„}, -J^logP (W n /a n > uq) has a, 
continuous in q, limit J-w(q). We allow the scalings {a n } and {v n } to be regularly varying with a 
positive index. This approach is general enough to incorporate sequences W, such that the probability 
law of W n /a n satisfies the large deviation principle with continuous, not necessarily convex, rate 
functions. The formula for these asymptotics agrees with the seminal papers on this topic [3, 6, 8, 9]. 



1. Introduction 

Let W = {W n : n € N} be a sequence of random variables taking values in R. Define Q = sup„ >0 W n . 
The random variable Q has been extensively studied: if W is time-reversible, then Q has the same 
distribution as the steady-state workload distribution in a queue with free process W (see, e.g., [13]); 
Q has also various relations with finance and insurance risk. It is in general hard to determine the 
distribution of Q. One could therefore settle for the less ambitious goal of identifying the corresponding 
tail asymptotics, that is, finding a function /, such that P (Q > u) ~ f{u), as u — > oo (i.e., the ratio of 
the two tends to 1 as u — > oo). This, however, requires to impose a quite restrictive structure on W, 
even in the Gaussian setting. Therefore, one usually resorts to determining the logarithmic asymptotics 
of the (right) tail of the distribution of Q. It has been observed that, in great generality, 

(1) logP(Q > it) = logP(3n £ N : W n > u) ~ logsupP(W / Il > u) , as u -> oo. 

n>0 

The heuristic behind this claim is the ■principle of the largest term: rare events occur in the most likely 
way. That is to say, if W is unlikely to ever reach level u, then conditional on W in fact reaching u, with 
overwhelming probability this happens near to the most likely epoch. 

Using (1), the tail behavior of Q can be derived from the large deviation behavior of W: this was orig- 
inally proposed by Kesidis et al. [10] and later made rigorous by Glynn and Whitt [9]. More formally, let 
A be the limiting cumulant generating function (CGF) of W , that is A(8) = lim rl _ ) . 0O n -1 logEcxp(#W / rl ), 
when the limit exists, and let A satisfy the assumptions of the Gartner-Ellis theorem. Then, by its virtue, 
the sequence of probability measures {/i n : n £ N}, where fi n is the law of W n /n, satisfies the large de- 
viation principle (LDP) with rate function A*, the Fenchel-Legendre transform of A (also known as the 
convex conjugate of A); see, [4] for background on large deviations theory. Under these assumptions, [9] 
asserts that 

(2) lim - logP(3n E N : W n > u) = — sup{(9 : A(0) < 0}. 

u— >oo u 

Owing to its generality, this result is useful in a broad range of applications. 

The result in [9] has been extended in a notable paper by Duffield and O'Connell [6]. The authors 
consider the logarithmic asymptotics of the tail of Q, by imposing assumptions on random variables 
W n /a n , where {a n : n £ N} is some (non-necessarily linear) scaling. It is assumed that the scaled limiting 
CGF of W, defined as A(9) = linin^oo v~ l log Eexp((9v„ W n /a n ) for some sequence {v n : n £ N}, exists as 
an extended real number. The considered class of admissible scaling sequences is quite broad. It includes 
the case when a £ Mi / (A), v £ M^{V) are two regularly varying sequences with indexes A,V > 0; this 
class is broad enough for most of the applications. Considering non-linear scalings allows to incorporate, 
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for instance, long/short range dependent sequences corresponding to, for example, fractional Brownian 
motion. Similarly to [9], it is assumed that A meets the assumptions of the Gartner-Ellis theorem. 
Consequently, the sequence of probability measures {/i„ : n G N}, where fj, n is the law of W n /a n , satisfies 
the LDP with speed v n and rate function A*. The main result from [6] states that, under some additional 
assumptions, 

(3) lim —— logP (3n G N : W n > it) = - inf C - v/A A*(c), 

«->oo h(u) c>0 

where h = v o a -1 and a^ 1 denotes the right inverse of a. It can be verified that (2) follows from (3) in 
case of the linear scaling: a n = v n = n. 

Both [6, 9] impose additional conditions on W only to infer that the sequence of probability measures 
{/i n : n € N}, whether fj, n is the law of W n /n or W„/a n , satisfies the LDP with some well-behaved 
rate function. By exploiting the Gartner-Ellis theorem, the considered class of possible rate functions 
is limited to convex functions, whereas, in general, it could be any lower semi-continuous function. 
Alternatively therefore, one could assume that the LDP holds without any knowledge of how it was 
inferred. Duffy et al. [8] suggested a variant of this approach that allows to consider scaled sequences 
W with nonconvex rate functions. More formally, the authors proposed the restricted large deviation 
principle (RLDP) instead of the classical LDP. That is, they required that the limit 

(4) lim — logP(Wn/a„ > c) = -J w {c) 

n— s-oo v n 

exists for every c > and the function Jw is continuous on the interior of the set upon which it is 
finite. Thus, if the LDP holds with a continuous (where finite) rate function I w (and speed v n ), then the 
RLDP holds with Jw(c) = mi x > c Iw{x). Nevertheless it is not required that Iw has been inferred from 
the Gartner-Ellis theorem, nor, in principle, that the LDP holds at all. Observe that Jw does not give 
any information about the behavior of W for negative values so that in general the LDP is not even a 
prerequisite for the RLDP to hold. The main result from [8], under some additional assumptions, reads 

(5) lim — !— logP (3n G N : W n > it) = - inf c~ v/A J w (c). 

■tt-S-oo h(u) c>0 

It can be verified, that if Jw{c) = inf x > c Iw{x) and Iw is convex, then (5) reduces to (3) in the special 
case when Iw = A*. 

Note that (4) is a statement about the limiting behavior of P (W n /a n > c) with n growing large. This 
condition does not extract any information about W for specific values of n, in particular the initial values 
of W. The distribution of Q however, and hence the asymptotics as well, does involve the whole sequence 
W. It is therefore possible that the asymptotics of a single W n could dominate those of Q. For instance, 
one can greatly alter Q by simply substituting Wq with a properly chosen heavy-tailed random variable 
Wq. To exclude such a scenario, Duffy et al. [8] introduced an additional, novel assumption referred 
to as the uniform individual decay rate hypothesis; see Section 2.2. Roughly speaking it prevents the 
sequence W from having an "unusual" behavior for a single W n . In fact, [8, Section 4] points out that 
this issue was overlooked by [6]. That is, in order for the results from [6] to hold, one actually needs to 
impose further conditions. In the light of the generality of the result by Duffy et al. [8], their paper can 
be treated as the most up to date treatment of the subject of logarithmic asymptotics for the supremum 
of a stochastic sequence. 

In this paper we generalize and extend the result from [8] in multiple ways. Firstly, in Theorem 4 we 
show that, under the assumptions of [8], the sequence of probability measures {fJ^ : u G R+}, where 
(A) = P (3n G N : W n G uA), satisfies the LDP with speed h(u) and rate function Iw, such that 
Iw{x) = x y l A , for x > 0, and Iw(%) = o°, for x < 0. In particular, for any A G fi(R+), 

(6) lim — - logP (3n G N : W n G uA) = — inf x v ' A inf c~ v l A J w {c). 
u— s-oo h(u) ieA oo 

One can see that (6) extends (5) by setting A = (1, oo). Theorem 4 is presented in Section 3.1. 

Furthermore, in Section 3.2 we allow the sequence W to take values in K d for any d > 1, rather 
than just R. As it turns out, this multidimensional setting imposes substantial additional challenges, as 
compared to the single-dimensional setting. Regarding notation, to explicitly distinguish the multidi- 
mensional case from the one-dimensional counterpart we will make use of the usual boldface fonts. That 
is, we write x for the vector x = {x\, . . . , x<j), where the dimension d should be clear from the context. 
All vector relations should be understood coordinate- wise; for instance we write v > w to mean Vi > Wi, 
for all i = l,...,d. With this notation, we consider a sequence W = {W n : n G N} of random vectors 
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in R d . W is assumed to satisfy multidimensional analogues of the assumptions from [8]; see Section 2.2. 
In particular, it is assumed that the (multidimensional) RLDP holds, that is, the limit 

lim — logF (W n /a n > q) = -J w (q), 

n— >oo v n 

exists for any q > 0. Our second contribution, Theorem 5, states that, for any vector q > 0, 

(7) lim — - logP (3n £ N : W n > uq) = - inf C - v/A J w (cq). 

n-*oo hiu) c>0 

Obviously (7) is a generalization of (5) in the multidimensional sense. 

Another significant contribution is that we also discuss various relations and connections with the 
existing literature. In Section 4.1 we present the relation between the RLDP approach, as undertaken 
here and in Duffy et al. [8], and the approach via the cumulant generating functions, as undertaken in 
[6, 9]. In Section 4.2, we discuss various results by Collamore [3], who considered a sequence of random 
vectors {Y n : n £ N} in R d , such that the sequence of probability measures {/i„ : n £ N}, where 
fi n corresponds to the law of Y n /n, satisfies the LDP with a convex rate function. He proved various 
LDP-like statements for the sequence of probability measures {/j%' n : u £ R+}, where /j%' N (A) = 
P(3n > N : Y„ £ uA). His results, not referred to in [8], coincide with (5)~(7) in the case of N = 1, 
linear scaling and convex rate function. We provide a discussion of these results also in Section 4.2. 

We conclude our paper in Section 5, where we present an extension of Theorem 5 from sequences 
{W n '■ n 6 N} to stochastic processes {Wt '■ t 6 R+}. Furthermore, we apply our results to two 
examples. In the first one we treat heavy tailed processes which exhibit nonconvex rate functions, an 
example that was not covered by results that were known so far. In the second example, we compare 
our results with [5], which addresses a similar problem for the case of W being Gaussian. 

2. Preliminaries 

In this paper we use the following notation. For a function / : R d — > R we denote its domain by 
T>j = {x : f(x) < oo}. As already introduced in Section 1, we shall work with the following two 
functions a, v : R+ — > R+, which throughout the whole paper are assumed to be regularly varying 
functions at infinity with positive indexes A and V, respectively; we write a £ 3$Y{A), v £ ^^(V). It 
is well known that, for any regularly varying function / £ &Y{F) with a positive index F, it is possible 
to construct a strictly increasing and continuous function /' such that 

Um m = lim « =lim /^) =c , 

t->oo f'(t) t^oo f(t) *->oo f(t) 

Therefore, without loss of generality, we assume that both a and v are continuous and strictly increasing. 
We shall also speak about regularly varying sequences a n and v n defined by a n = a(n) and v n = v(n). 
Let a -1 denote the inverse of a. Define a new function h : R + — > R + by h = v o a^ 1 . The function h 
belongs to the class MY{y/A). For details on regular variation, see, e.g., [2]. 

For any subset A of R d we denote its cone by cone(^4) = {Ace : x £ A, A > 0}, its closure by A 
and interior by A° . For any convex function / on M. d we define its Fenchef-Legendre transform /* as 
f*(x) = sup a6Rrf {{a, x) - /(a)). 

2.1. Large deviations theory. We follow the definitions and setup as used in Dembo and Zeitouni [4]. 
All probability measures in this paper are assumed to be Borel measures. The function I : R d -> [0, DC 
is called a rate function if I is lower semi-continuous and I ^ oo. We say that / is a good rate function if, 
in addition, the level sets Jzf a I = {x : I(x) < a} are compact for each a > (which in fact is equivalent 
to the level sets being bounded as a function / is lower semi-continuous if and only if its level sets are 
closed). A sequence of probability measures {fi n : n £ N} on R d is said to satisfy the large deviation 
principle (LDP) with rate function I if, for all T £ £>(R d ), 

(8) — inf I(x) < liminf — log/i„(r) < lim sup — log fi n (T) < — inf I(x). 

x€T° ri-s-oo n n-yoo n x£T 

Consider the empirical mean Z n = — J^ILi °f a sequence of i.i.d. random vectors {X n : n £ N} in 
R d . Define fx n as the law of Z n and let Ax = logEe^ a,Xl ^ be the cumulant generating function (CGF) 
associated to the law of X\. In this case the classical theorem of Cramer applies. 

Theorem 1 (Cramer's theorem). Assume that £ T^\ x , then : n £ N} satisfies the LDP on M. d 
with good rate function A x . 



Now consider a general sequence of random vectors {Z n : n £ N} in M. d , and let fj, n denote again 
the law of Z n . The CGF associated with the law fj,„ is defined as A„(a) = logEe^ a,z "^ . Let A(a) = 
limsup,^^ iA n (na) be the limiting CGF. With this notation, the following well-known theorem holds. 

Theorem 2 (Gartner-Ellis theorem). Assume that £ T>\ and A is an essentially smooth, lower semi- 
continuous function. Then {[i n : n £ N} satisfies the LDP on R d with good rate function A*. 

Note that if Z n = — Y^i=i -^i as m the setting of Cramer's theorem, then A = Ax and further 
regularity conditions are not required. 

One can consider LDPs with the so-called speed {s n : n £ N}, when 1/n in (8) is replaced by l/s« — > 0. 
Then, the Gartner-Ellis theorem remains valid if A(ct) = limsup„_ >00 — A„(s„a), the scaled limiting 
CGF, satisfies the assumptions. All results of this subsection carry through to continuous parameter 
families {fjL u : u £ R+}. 

2.2. Main assumptions. In this paper we consider sequences W = {W n : n £ N} of random vectors 
in R d , d > 1, satisfying the following assumptions. 

Restricted LDP Hypothesis. There exists a function J\\r : R+ — >■ [0, oo] such that, for every q > 0, 
(9) lim — logP ( — ^ > q = -J w (q). 

Remark 1. If the sequence of probability measures {/i„ : n £ N}, where \i n denotes the law of W„/a„, 
satisfies the LDP with speed v n and rate function Iw , which is continuous where it is finite, then it also 
satisfies the RLDP hypothesis with Jw(<l) = inf x >q Iw{x) (hence the name of the hypothesis). If / is 
not continuous then it is easy to construct an example in which the restricted LDP does not hold. The 
opposite implication is also not true in general, that is, the restricted LDP hypothesis does not imply 
the LDP: property (9) does not provide any information about the negative values of W n /a n . 



Stability and continuity hypothesis. Jw(0) > and there exists y > such that J\v(y) < oo. 
Furthermore, Jw is assumed to be continuous on "Dj . 

In the queueing context, Jw(0) > is the usual stability condition. If Jw(x) = oo for all x £ R 1 ^ \ {0}, 
then F(3n £ N : W n > uq) will have superexponential decay. 

The restricted LDP hypothesis refers to limiting behavior of logP(W n > ua n q) for large n, not values 
for specific n. The asymptotic of a single W„ could dominate those of ¥(3n £ N : W n > uq). The 
condition below excludes this possibility. 

Uniform individual decay rate hypothesis. For a fixed vector q > 0, there exist constants F = 
F(q) > V/A and K = K(q) > so that for all n and all c > K, 

J-lo gF (Ejl >cq ] < _ c *\ 
v n \ a n J 

Remark 2. The restricted LDP hypothesis, the stability and continuity hypothesis and the uniform 
individual decay rate hypothesis were originally introduced in [8] in the one-dimensional case. That is, 
if one sets d = 1, then all the above hypotheses reduce to those of [8]. The hypotheses presented above 
can therefore be seen as natural extensions of the hypotheses from [8] to the multidimensional setting. 

Theorem 3 (One-dimensional case, Theorem 2.2, Duffy et al. [8]). If the sequence W = {W n : n £ N} 
of random variables satisfies all the hypotheses of this subsection, then 

(10) lim — !— logP (3n £ N : W n > u) = - inf C - v/A J w (c). 

■tt-S-oo h(u) c>0 



3. TWO EXTENSIONS 

In this section we present two generalizations of Theorem 3. Firstly, we consider families of measures 
/i„ (A) = P (3n £ N : W n £ uA) for a general set A; one easily sees that Theorem 3 considers the case of 
A = (1, oo). Secondly, we consider the situation when the sequence W takes values in M. d , d > 1, rather 
than just R. 
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3.1. Extension to the LDP. Define a lower semi-continuous function Iw by 

f , s _ ( oo for i<0, 
W{X) ~ { k Jw x v ' A for *>0, 

where kj w = inf c> o c~ y l A Jw(c) is the constant appearing on the right-hand side of (10). In this 
subsection we assume that the sequence of random variables W = {W n : n e N} is such that Wo = 0. 
This assures that Q = sup n>0 W n is a non- negative random variable. Let ^ be a probability measures 
on K defined as fj^(A) = P (3n G N : W n G mA) . The following theorem is the main result of this 
subsection. 

Theorem 4. Under the assumptions of Theorem 3, the family {/j,™ : u G M+} satisfies the LDP with 
speed function h and good rate function Iyy ■ 

Proof. First notice that, for any k > 0, 

(11) lim -^-logf^((k,o )) = -I w (k). 
u->-oo h(u) 



Indeed, 



1 ^~..w fll . « _ h (ku) 1 



log/U„ ((fc,oo)) = — log^((l,oo)), 



so that (11) follows from Theorem 3 combined with the fact that h(ku)/h(u) tends to k v l A . 

Let r be any set in S(R). Now if for some y, inf se f Iw{x) = Iiv(y), then, for any 77 £ (0,y), the 
monotonicity of I\y on R + implies that 

ltfF)<tf(]y,<x>))<ltf«V-ri,oo)). 

Hence, by (11), 

lim sup— log/xJf(r) < -i w (y-i]). 

This combined with the continuity of 7^/ on R + implies the upper bound: 

lim sup — !— logP (3n G N : W n G «r) < — inf I w (x). 
u^oo n{u) ier 

If mi xe f. Iw{ x ) — or inf xg p 7w( x ) = OO; then the above bound holds trivially. 
Now if T° n M + = 0, then the lower bound 

lim inf — — logP (3n G N : W n G uT) > — inf I w (x) 
u->oc h(u) ier° 

holds immediately. Therefore, let r° C R+ and s G T , r\ > be such that (s — 77, s + ??] C T . Obviously, 
Atjf( r ) > A*r((s -V,s + J?]) = ^(( s ~ f],oo)) - /i^((s + ??,oo)). From (11), for sufficiently large u, 
^((s + V, 00)) < A^f (0 - «7. o°))/ 2 - Hence, 

liminf -L log^(r) > liminf -L log = _f w{s j > _|^ (s) . 

Now the lower bound follows after optimization over s G r°. □ 

Remark 3. The assumption that Ho = is natural and fulfilled in many applications. If it does not hold, 
however, then the LDP from Theorem 4 remains true on 6(K+). In both of these cases, the continuity 
of I\y on K + implies that for any set A G S(R+), 

lim — !— logP (3n G N : W n G uA) = — inf I w (x). 



Hence, Theorem 4 generalizes Theorem 3, which can be seen by taking A = (1, 00). 
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3.2. Extension to the multidimensional case. In this subsection we generalize Theorem 3 to the 
multidimensional case. Theorem 3 itself serves as a building block in the proof of the following theorem. 



Theorem 5 (Multidimensional case). If the sequence W = {W n : n £ N} of random vectors in M. d , 
d > 1, satisfies the hypotheses of Section 2.2, then, for any q > 0, 

lim —— logP {3n £ N : W n > uq) = - inf c~ v/A J w (cq). 
m— s-oo h(u) C>0 

Proof. Notice that P(lf„ > uq) = P (Z n > u), where Z = {Z n : n £ N} is a sequence of random 

variables such that Z n = minj=i d{W n ^/qi). The sequence Z satisfies the assumptions of Theorem 3 

with a function Jz given by J zip) = Jw(cq). Indeed, by the restricted LDP hypothesis, it follows that 
for every c > 0, 

lim — logP f — > c ) = lim — logP f — - > cq] = -J w (cq). 

n-Kx V n \ a„ J n^>oo v n \ a n J 

The stability and continuity hypothesis for Jz easily follows from the stability and continuity hypothesis 
for Jw ■ The same applies to the uniform individual decay rate hypothesis. Therefore, Theorem 3 yields 

lim — !— logP (3n £ N : W n > uq) = lim -j— logP (3n 6N:Z„>ii) 

U-KX) h(u) ti->oo h(u) 



= - inf c- v/A J z (c) 



c>0 



= - inf c- v/A J w (cq). 

c>0 

This completes the proof. □ 



4. Connections with existing literature 

We have already discussed the relations of our results to [8]. In this section we shall discuss our 
findings in the light of already existing results from [3, 6, 9]. 

4.1. The cumulant generating function approach. The analyses in [6, 9] are based on the CGF. 
Both of them consider the case of d = 1, but only [6] allows for non-linear scaling. In this subsection we 
present conditions under which the main assumptions of the present paper are fulfilled. Recall that, 

A n (a)=togEaq>r/a ) ^M 

is the CGF of the law of W n /a n . Here it is assumed that A„ exists as a finite real number for all 
a E R. d and all n £ N. The hypotheses of our paper have simple expressions in terms of the CGF. 
The conditions we specify here for the CGF case, based on their one-dimensional analogues in [8], are 
intended for easy applicability, rather than maximum generality. Under these assumptions, the large 
deviation rate function is convex. The CGF technique is not applicable to models which have nonconvex 
rate functions. 

LDP hypothesis, CGF case. For each a £ M. d , the scaled limiting cumulant generating function 
A(a) = lim„_ i . 00 A n (v n a.) exists. Furthermore, A is assumed to be continuously differentiable. 

Under the above assumption, by the Gartner-Ellis theorem, the sequence of probability measures {//„ : 
n £ N}, where fi n is the law of W n /a n , satisfies the LDP with rate function Iw = A* and speed v n . 
This implies that Iw is convex and continuous on the set where it is finite and therefore the RLDP 
holds with Jw(q) = inf x >q A* (a;), which is also continuous on T>°j . Hence, in order to assure that the 
stability and continuity hypothesis holds, one can require the following. 

Stability hypothesis, CGF case. There exists ex.* > such that A(a*) < 0. 

The above hypothesis implies that Jw(0) > — A(a*) > 0. Indeed, 

Jw(0) = inf sup ((a, x) — A(a)) > sup inf ({a,x) — A(a)) = - inf A(a) > — A(a*). 
x >°aeR d a m d + x ^° «e R + 

Uniform individual decay rate hypothesis, CGF case. There exist constants F' and M such that 
F' > ma,x{V/A, 1} and ±A n (v n a.) < M]|a|| F '/(^'-i) f or all a > and all n £ N. 
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Under this hypothesis, for each F € (1,F'), there exists a constant Kp = Kp(q) such that, for all 
c > Kp, and all n 6 N, 

J_lo g p(!?Jt < -c F . 

v n \ a n J 

That is the uniform individual decay rate hypothesis certainly holds. Indeed, an elementary consequence 
of Chernoff 's inequality is 

\og¥(W n > ca n q) < -v n (c(a,q) - — A n (v n a) ] , 
for any a > 0. It then follows that 

\ogP(W n > ca n q) < -v n (c(cL,q)-M\\a\\ F 'K F '-V) . 
Choosing a = (c(F' - l)\\q\\/{MF')) F '- l q, we have 

(12) logP(TV„ > ca n q) < -Vn(c\\q\\ 2 ) F ' (m 1 ~ f> F'~ F ' (F' - if- 1 ) . 

Since M and F' are constants, for each F G (max{V/A, 1},F'), there exists Kp = Kp(q) such that, for 
all c > Kp, the right hand side of (12) will be less than —v n c F . 

4.2. The LDP with a convex rate function. The purpose of this subsection is to discuss the differ- 
ences between the approach from [3] and the one in our paper. 

Collamore [3] considered a sequence Y = {Y n : n G N} of random vectors in M. d . The main assumption 
of his paper is that the sequence of probability measures {/i„ : n G N}, where \x n is the law of Y„/n, 
satisfies the LDP with a convex rate function I Y , such that J£qI y 9, that is I Y (x) = for some 
self Furthermore, it is assumed that (using the notation from [3]): 

HI': sup n>7V i logEcxp (a, Y n ) < oo for all a € J^foly and N greater than or equal to some No- 

Remark 4. HI' is a regularity condition on the sequence Y . It is satisfied if Y is, for example, the nth par- 
tial sum of an i.i.d. sequence of random vectors satisfying the condition given in Cramer's theorem, or the 
weaker condition given in [12]. It also holds when Y is a Markov-additive process satisfying the uniform 
recurrence condition (6.2) in [11]. In both these cases No can be taken to be 1. Y can also be a general 
sequence satisfying the conditions of the Gartner-Ellis theorem and (i) A(a) is finite in the neighborhood 
of each a £ J^oA; (ii) the level sets of A are compact; recall that A(a) = limsup^^^ i logEexp (a, Y n ). 

Theorem 6 (Collamore [3], Theorem 2.2). Suppose A is a general set in K d and HI ' and 

H2: for some S > 0, Ad cone(^) = 0, where % = {x : mf ye _5? / y \\x — y\\ < 5}; 
are satisfied. Then, for any N > A^o, 

liminf - logP (3n > A^ : Y„ 6 uA) > - inf I Y (x) 

u->oo u x£A° 

and 

limsup - logP (3n > N : Y n G uA) < - M_I Y (x), 
where Iy(x) = sup cte ^ ?o j^ r (a,x) is the support function of J£qI y ■ 

Remark 5. Condition H2 is an admissibility condition on sets A. Recall that J^foly is the set of all the 
points y for which I Y (y) = 0. Intuitively, these are the points of the typical behavior of Y. Recall 
also that in the setting of Cramer's theorem, when Y n = X% + . . . + X„ , for an i.i.d. sequence of 
random vectors {X n : n £ N} in R d , Jf?oI Y = {KXi}. Therefore, the set % can be thought of as the 
5- neighborhood of all such points of typical behavior of Y, and thus A can be any general set that avoids 
the "central tendency" cone(^) = {Xx : A > 0, x G of Y. 

The straightforward major differences between the current approach and the one from [3] are the 
following. [3] considers the multidimensional case and sets satisfying H2, but only allows linear scaling. 
The sequence Y has to satisfy HI' and the sequence of measures corresponding to Y n /n, the LDP 
with convex rate function. In our setup we considered the multidimensional case and regularly varying 
scalings, general sets in the case of d = 1, but only quadrants {x G M. d : x > q], for any q > 0, in 
the case of d > 1. Furthermore, we do not require the LDP to hold, but impose the restricted LDP 
hypothesis allowing for continuous rate functions. We have already explained that non-linear scalings 
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allow to incorporate, for instance, long/short range dependent sequences stemming from, for example, 
fractional Brownian motion. Also, as explained in Remark 1, it is possible that the restricted LDP holds 
when the LDP does not and vice versa. 

Observe that, if Iy- is continuous where finite, then the restricted LDP holds with Jy(q) = inf x >q Iy (x). 
By [14, Theorem 13.5], Iy(x) is equal to the closure of L{x) = inf T> o T~ x ly (tx), that is, the greatest 
lower semi-continuous function majorized by L. Furthermore, if A = {x £ M. d : x > q}, for some q > 0, 
satisfies H2, then the upper and the lower bound in Theorem 6 are equal to 

inf Iy(x) = inf inf r _1 Iy{tx) = inf r _1 inf Iy(tx) = inf t^ 1 Jy(Tq). 

x>q x>qr>0 r>0 x>q r>0 

Hence, if, in addition, we are allowed to take No = 1, then Theorem 6 coincides with Theorem 5 in the 
case of linear scaling and convex rate functions. 

5. Examples 

In this section we discuss some of the examples in which the theory of this paper can be applied. 
Firstly, we shall consider an example that does not fit in frameworks of any of the previous paper. This 
is due to its multidimensional nature and the fact that the rate function appearing there is nonconvex. 
Secondly, we shall consider a multidimensional Gaussian example and shall try to recover some previously 
known results. 

Let us begin with discussing an extension of Theorem 5 from sequences {W n : n £ N} to stochastic 
processes {Wt ■ t £ K+}. To this end we formulate an additional necessary hypothesis. 

Extension hypothesis. 

logP(3t 6 (n,n + 11 : W t > uq) ,. logP (W n >uq) 
hm sup — — = hm sup — — — 

«->oo IlgN h(u) «->oo neN h(u) 

The above hypothesis was also introduced in [8] in the one-dimensional case. Therein, it is argued that 
under this hypothesis, for the case of d = 1, Theorem 3 extends from sequences {W n : n £ N} to processes 
{Wt ■ t £ It is straightforward to conclude from the proof of Theorem 5 that this is also the case 

if d> 1. 

5.1. Application to heavy tailed processes. Let us consider processes of the type described in Duffy 
et al. [8, Section 3.2]. To this end, we first introduce the heavy tailed distribution H by 

f(n > x) = l(x)e- v{x) , 

where I is a slowly varying function and v £ MYiV) with V £ (0, 1). 

Now consider a continuous time process {Y t : t £ R + } taking the values and 1, with the times spent 
in the and 1 states being a sequence of i.i.d. random variables with the same distribution as W. For 
c > 0, define a process {Z t : t £ M.+ } via 

(13) Z t = [ (Y s - c)ds. 

Jo 

It was shown by Duffy [7] that the family of probability measures : t £ where fit is the law of 

Zt/t, satisfies the LDP with speed function v(t) = t v and the (nonconvex!) rate function 

f (l~2(x + c)) v ifx£ [-c,i-c], 

(14) I c (x) = < (2(x + c) - l) v if x£ [\ -c,l-c], 

[ oo otherwise. 

Now consider two independent processes {Y^ : t £ M + } and {F t 2 : t £ M+} defined as above and 
construct the corresponding processes Z 1 = {Z\ : t £ M + } and Z 2 = {Zf : t £ R+} via (13) with 
constants c\ > and C2 > 0, respectively. Now let Z = {Z t ■ t £ K.+ }, where Z t = (Z\, Z\). Note that 
the rate function Iz corresponding to Z is given by Iz{x) = I Cl (xi) + I C2 {x2), where I Cl and I C2 are given 
by (14). Finally, define a new process W = {W t : t £ M + } via (W t \ W t 2 ) = {z\,Z\ + Z}). According to 
the contraction principle [4, Theorem 4.2.1], the family of probability measures {fit '■ ^ G ^+} on 
where fi^ is the law of Wt/t, satisfies the LDP with speed v(t) = t v and rate function Iw given by 

I w (x)= inf (I Cl (vi) + I C2 (v 2 )) = I ci (xi) + I C2 (x 2 - xi). 

v£R 2 :(v 1 ,V2) = (xi,X2—x 1 ) 

Hence W satisfies the restricted LDP hypothesis with rate function Jw{q) = hifcc>q I\v( x ), q>0. 

8 



Let q\ = qi + c\ and q-i = g2 + c\ + C2. Elementary calculus reveals that, with g > 0, 



Jw(g) 



lw(qi,qi + I - c 2 ) 

Av(<?2 + C 2 - 5,92) 

7w(gi,gi - c 2 + 1) 

00 



if gi G and g 2 6 (1, 

2-5; if g\ G (5,1] and g 2 G [| +gi,§], 
1 + I(g 2 - 1) if gi 6 (|, 1] and q 2 G [§, 1 + gi], 
1 + J(gi) if gi G (5, 1] and g 2 G [1 + gi,2], 
00 if qi > 1 or g 2 > 2, 



where J = Jo- Recall that ci,ci G (|, 1), thus we do not define Jw{q) for q\ < |, g 2 < 1. Note also 
that, if ci + c 2 G [f ,2), then g 2 > |, so that the first two cases in the definition of Jw do not occur. 
Finally, one can easily check that Jyy is continuous on the interior of the set where it is finite and that 
Jw(0) > 0. Hence the stability and the continuity hypothesis holds. 

The coordinates of W are bounded above (by 1 — c\ and 2 — c\ — c 2 respectively) , the uniform decay 
rate hypothesis is satisfied. Note also that, for every n and t G (n,n + 1], Wt = W n + R n ,t, where 
R n ,t = (Rl^Rlj) with both Rl t = f*(Yf - a) da and g* >t = fffi + Y 2 - ci - c 2 )ds bounded 
independently of t. This immediately implies the extension hypothesis too. Therefore, the extended 
version of Theorem 5 applies. It gives, for every q > 0, 



lim \ logP (Si G M+ : / (Y, 1 - Ci) ds > ugi, / (X, 1 + Y 2 - a - c 2 ) ds > uq 2 



Let 



inf 



Jw(tq) 



, then with self-evident notation, if ci + c 2 G [§, 2), 




gi 

l-C; 
92 



and if ci + c 2 G (1, |), with x A y = min(x,y) 

ft 1 ' ^ 
,2 



2|(l- Cl ) + 2( Cl + C2)- 
1+ [2|i(2- Cl -c 2 )+2 Cl -l 



91 



-inf Jw f g) . 

t>o t y 



91 > ba 

92 — 2 — Cl — C2 : 

91 < 

92 — 2 — ci— c 2 : 



inf hit) = { h 2 
t>o 



91 

|— ci— < 

92 

§— Cl— C 2 



A ft- 4 ( 2^£3-£2 
92 



1-Cl 
92 



91 > , 

92 — f — C1-C2 



C1-C2 

2 / l-Cl-C; 



91 



1 — ci 1 — ci 

2— ci— c 2 ' tt — ci- 



92 



A ft 4 ( 2 ' C1 ~ C2 

92 



92 

gi <- 1-ci 

92 — 2 — ci— c 2 



5.2. Application to Gaussian processes. In this subsection we consider an example from Debicki 
et al. [5, Section 3.2]. That is, let Y = {Y(t) : t G K+} be a centered Gaussian process in W l with 
stationary increments and covariance matrix S t = diag(citr 2 (t), . . . , Cd<J 2 {t)), so that the coordinates 
of Y(t) = (Yi(t), . . . ,Yd(t)) are independent and, for each i = l,...,d, Var(Y"i(t)) = Ci<j 2 {t), for some 
d > 0, and a 2 G where 7 G (0,2); compare these assumptions with assumptions C1-C3 of [5, 

Section 3.2]. 

For an invertible matrix S, define a new Gaussian process W = {W(t) : t G K + } in R d via W(t) = 
SY(t) - i(t), where i : R -> R d is such that i(t) = (t, . . . , t). Set a(t) = t and v(t) = t 2 a^ 2 (t) and notice 
that, 



At (a) = logEexp I ( a 



W(t) 



1 

= o ( 



a) - (a,i(l)>, 



a(t) /y 2 \ ' t 2 

so that the CGF variant of the uniform individual decay rate hypothesis holds with F' = 2. Furthermore, 
A(a) = lim J-A 4 («(t)a) = - (a, S*C5 T o;) - (a, t(l)> , 

i->cx> u(f) 2 

where C = diag(ci, . . . , cd), so that the CGF variant of the LDP hypothesis holds and the RLDP is 
satisfied with 

J w (q) = inf A*(x) = I inf (S'^x + »(1)), C^S'^x + *(1))> , 

a^>q Z ic^9f 

where the form of A* follows from Rockafellar [14, Theorem 12.3]. From this formula it follows that the 
stability and continuity hypothesis holds. Finally, by the stationarity of increments of Y, for any e > 0, 



(15) P (3t G (n, n + 1] : W(t) >uq)<¥ (3t G [0, 1] : W(i) > ueq) + P (W(n) > u(l - e)g) 



Now, for any x > 0, define a new Gaussian process via Z(t) = u^ q '\^ ■ Using Borell's inequality, see, 
e.g., [1, Theorem 2.1], 

(u - u) 2 

(16) logP (3t £ [0, 1] : W(t) > ueq) < logP (3t £ [0, 1] : Z(t) > u) < - y - ^ , 

2cH 

where fi = Esup te [ ji Z(t) and ct 2 = sup te r 01 i Wax(Z(t)). Combining (15) and (16) one can retrieve 
the extension hypothesis after proper optimization in e — > 0. Hence, the extended version of Theorem 5 
implies that, 

, , C7 2 (u) , 1 (S'- 1 (a; + i(l)),C- 1 S*- 1 (a; + *(1))) 
17 lim -At 1 logP (3t £ R+ : Wit) > uq) = -- inf inf ^ i V _ i 

u^oo u z 2c>Qx>cq C z T 

The change of variable c^f 1 gives 

(5- 1 (o ; + i(l)),C-^- 1 ( a; + i(l))) . , . , (S- 1 (a + i(c- 1 )),C- 1 5- 1 (a + i(c- 1 ))) 
mi mi = mi mi 

c>0x>cq C 2-7 c>0x>q C — 1 

(S-^x + i^^^S-Hx + iit))) 
= mi ml -. 

t>0x>q VI 

Therefore, (17) coincides with [5, Proposition 2]. 
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